Optimal unambiguous filtering of a quantum state: An instance in mixed state 

discrimination 



in 
o 
o 

(N 

Ph. 
< 



Janos A. Bergou^, Ulrike Herzog^, and Mark Hillery^ 
^Department of Physics, Hunter College, City University of New York, 
695 Park Avenue, New York, NY 10021, USA and 
^ Institut fiir Physik, Humboldt- Universitdt zu Berlin, Newtonstrasse 15, D-124-89 Berlin, Cermany 

(Dated: February 9, 2008) 

Deterministic discrimination of nonorthogonal states is forbidden by quantum measurement the- 
ory. However, if we do not want to succeed all the time, i.e. aUow for inconclusive outcomes to 
occur, then unambiguous discrimination becomes possible with a certain probability of success. A 
variant of the problem is set discrimination: the states are grouped in sets and we want to determine 
to which particular set a given pure input state belongs. We consider here the simplest case, termed 
quantum state filtering, when the N given non-orthogonal states, {IV*!)! ■ • ■ > \'4'n)}, are divided into 
two sets and the first set consists of one state only while the second consists of all of the remaining 
states. We present the derivation of the optimal measurement strategy, in terms of a generalized 
measurement (POVM), to distinguish |i/>i) from the set {|'!/'2), ■ • ■ , I'^iv)} and the corresponding op- 
timal success and failure probabilities. The results, but not the complete derivation, were presented 
previously [Phys. Rev. Lett. 90, 257901 (2003)] as the emphasis there was on appplication of the 
results to novel probabilistic quantum algorithms. We also show that the problem is equivalent to 
the discrimination of a pure state and an arbitrary mixed state. 

PACS numbers: PACS:03.67.-a,03.65.Bz,42.50.-p 
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I. INTRODUCTION 

In quantum information and quantum computing the 
carrier of information is a quantum system and infor- 
mation is encoded in its state. In the simplest case the 
system lives in a two-dimensional Hilbert space and the 
two basis vectors are conveniently asssociated with the 
logical and 1. Such a two- level system is called a qubit. 
However, it is not necessary to restrict our attention to 
qubits; d-dimensional systems, or qudits, can also be used 
to store quantum information. Reading out the informa- 
tion from the quantum system is tantamount to identi- 
fying the state it is in where the state itself might be the 
output of a quantum channel or the result of a quantum 
computation. 

We want to find the optimum measurement that ex- 
tracts information about the state. This problem is dif- 
ferent from the usual textbook measurement as no en- 
semble averaging is involved nor are we interested in the 
average value of some physical observable. Every time 
a system reaches the final step we want to determine 
its state. Since the state of a system is not an observ- 
able in quantum mechanics, this sounds at first as if it 
is an impossible task. There are ways around it, how- 
ever. Quantum processors are designed in such a way 
that their output is a member of a set of known states, 
so we are facing the more modest problem of determining 
which of these states was realized. If the possible target 
states are mutually orthogonal this is an easy task: we 
just set up detectors along the corresponding orthogonal 
directions and determine which one clicks (assuming per- 
fect detectors, of course). However, if the target states 
are not mutually orthogonal the problem is still difficult 
and optimization with respect to some reasonable crite- 



ria leads, in general, to highly nontrivial measurement 
strategies. Finding the optimal measurement strategy is 
the subject of state discrimination. An overview of the 
state-of-the-art in the area of state discrimination can be 
found, for example, in our recent review jj] so here we 
just recall the immediate preliminaries. 

One possible criterion is that no error is permitted, 
i.e. the states have to be discriminated unambiguously. 
Quantum measurement theory tells us that it is im- 
possible to unambiguously discriminate between non- 
orthogonal quantum states with unit probability of suc- 
cess so we have to settle for less. If we don't require that 
we succeed every time, then unambiguous discrimination 
becomes possible. When the attempt fails, an inconclu- 
sive answer is returned. The optimal strategy is the one 
that minimizes the average probability of failure. Inter- 
est in unambiguous state discrimination was renewed by 
the suggestion to use non-orthogonal quantum states in 
certain secure quantum cryptographic protocols, in order 
to establish a secure key. A particularly clear example, 
based on a two-state procedure, was developed by Ben- 
nett 13 . 

In most of the previous work discrimination among all 
members of a set of states was considered. Subsequently, 
we turned our attention to the following variant of the 
problem. Instead of discriminating among all states, we 
ask what happens if we just want to discriminate be- 
tween subsets of them. In this class of problems we know 
that a given system is prepared in one of N known non- 
orthogonal quantum states, but we do not know which 
one. We want to assign the state of this system to one 
or the other of two complementary subsets of the set of 
the N given states where one subset has M elements and 
the other has iV - M (M < N/2). Since the subsets 
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are not mutually orthogonal, the assignment can not be 
done with a 100% probability of success. For the case 
that the assignment is to be performed with minimum 
error, the solution has been found for arbitrary M and 
N under the restriction that the Hilbert space spanned 
by the states is two-dimensional 3] . For the case that the 
assignment is required to be unambiguous, at the expense 
of allowing inconclusive results to occur, the probability 
of which is minimized, the problem has been solved for 
M = l,A^ = 3in (Jj. We refer to either case as quantum 
state filtering, a term that we coined inH , when M = 1 
and > 3. The solution presented in Q can be general- 
ized in a straightforward manner to arbitrary N . In our 
recent work we presented the exact analytical solution, 
but not its derivation, of the unambiguous quantum state 
filtering problem - the case of M = 1 and N arbitrary, 
with no restriction on the states - and employed it to 
develop a novel quantum algorithm Q . 

In this paper we fill in the gaps and derive the solution 
that we used in j^. The paper is organized as follows. 
In Section II, based on simple but rigorous arguments, 
we derive the main analytical solution to the optimal 
POVM problem. It invokes Neumark's theorem in order 
to develop a physical implementation of a generalized 
measurement. In Section III, we investigate the region of 
validity of the POVM solution and show that outside this 
region standard von Neumann projective measurements 
can be used to perform optimal unambiguous discrimi- 
nation. In Section IV, we connect the problem of quan- 
tum state filtering to the unambiguous discrimination of 
mixed quantum states and show that our solution can be 
viewed as the discrimination of a pure state and an ar- 
bitrary mixed state. In Section V we give an alternative 
derivation of the optimal measurement which is based 
on considering the geometry of the Hilbert space, and it 
is closer in spirit to the standard approach to POVMs. 
Of course, the results are identical to those of the pre- 
vious sections. A brief discussion of recent experimental 
progress and conclusions are given in Section VI. 



II. DERIVATION OF THE OPTIMAL POVM 

Suppose we are given a quantum system prepared 
in the state which is guaranteed to be a mem- 

ber of the set of N known non-orthogonal states 
(IV'i), |'02)j • ■ • J iV'Tv)}, but we do not know which one. 
We denote by rji the a priori probability that the system 
was prepared in the state IV'i)- We want to find a pro- 
cedure that will unambiguously assign the state of the 
quantum system to one or the other of two complemen- 
tary subsets of the set of the N given non-orthogonal 
quantum states, either {IV'i)} or {|'02), . . . , IV'Af)}- For 
unambiguous discrimination the procedure has to be 
error- free, i. e. it may fail to give us any information 
about the state, and if it fails, it must let us know that 
it has, but if it succeeds, it should never give us a wrong 
answer. Clearly, this is a variant of the unambiguous 



state discrimination problem, and we shall refer to such 
a procedure as quantum state filtering without error. We 
find that, in contrast to the unambiguous state discrim- 
ination problem, this will be possible even if is not 
linearly independent from the set {|?A2), • ■ • , IV'Jv)}- 

According to the quantum theory of measurement, the 
states cannot be discriminated perfectly if they are not 
mutually orthogonal. Thus, if we are given we will 
have some probability pi to correctly assign it to one of 
the subsets and, correspondingly, some failure probabil- 
ity, qi = \ — Pi, to obtain an inconclusive answer. The 
average probabilities of success, P, and failure, Q — l~P, 
to correctly assign the states i = 1, . . . N , are 

N 

P = ^VtPt, 

i 

Q = 51^**, (2.1) 

i 

respectively. Our objective is to find the set {qi} that 
minimizes the average probability of failure, Q, or, equiv- 
alently, the set {pi} that maximizes the average proba- 
bility of success, P. 

The procedure we shall use is a so-called "generalized 
measurement" , based on positive-operator valued mea- 
sures (POVM, 0). Using Neumark's theorem, a POVM 
can be implemented in the following way . We first em- 
bed the system in a larger Hilbert space, /C, consisting 
of the original system space, 7i, and an auxiliary Hilbert 
space called the ancilla, A. We take /C to be a tensor 
product, /C = Ti. ® A. Then we introduce an interaction 
between the system and ancilla corresponding to a uni- 
tary evolution on this larger space. The unitary evolution 
entangles the system degrees of freedom with those of the 
ancilla. Finally, a projective measurement is performed 
on the extra degrees of freedom. Due to the entangle- 
ment, a click in the ancilla detectors will also transform 
the state of the original system in a general way. We 
choose this resulting transformation of the system states 
to be the most appropriate for our filtering purposes. 

In order to accomodate N states the dimension of the 
system space H, d, need be no more than N, i.e. d < N. 
Equality holds when all of the vectors, {ipi), are linearly 
independent. We will use N as the dimensionality of H 
in the following treatment. The dimension of the ancilla, 
A, is a key point in obtaining the optimal solution and 
we will consider it next. 

The input state of the system is one of the vectors \ipi), 
which is now a vector in the subspace TC of the total space 
/C, so that 

\^^)^n = \^rM) , (2.2) 

where \(f>Q') is the initial state of the ancilla (same for all 
inputs). Following the general procedure outlined in the 
previous paragraph for the generalized measurement we 
now apply a unitary transformation, U, that entangles 
the system with the ancilla degrees of freedom. As a re- 
sult, the input vector transforms into the state \ip'^)out- 
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This state can be expanded using a basis for 
A. For the purposes of optimum unambiguous discrim- 
ination between the two sets, we want three different 
outcomes when a projective measurement is performed 
on the ancilla: one that teUs us that the input was a 
state from the first set, one that teUs us that it was from 
the second set and one that tells us that the discrimi- 
nation failed. Thus, we require the ancilla to be three- 
dimensional (fc = 1,2,3), as explained below, yielding 



= 4il^nK) + (i-4i)l^:'')k^) 



(2.3) 



In the following we drop the upper index Ti and A if it 
does not lead to confusion. We also note that the states 
[■(/ij^) and IV'"^) are not normalized. From the construc- 
tion of the output state we see that the first outcome is 
compatible with the first input, the second with an input 
state from the second set and the third outcome is com- 
patible with both inputs. We might want to require that 
IV'i) be distinguishable from j^/jj), . . . , IV'at)' yielding the 
condition. 







(2.4) 



for j = 2, . . . ,N (in general, i runs from 1 to and j 
from 2 to N). Strictly speaking, though, this condition 
is only convenient but not necessary. 

Now, a state selective measurement is performed on the 
ancilla that projects |?Af')o«t onto one of the basis vectors 
\mi) {i = 1, 2, 3). If it projects |V'j^)o„t onto |mi) or |m2), 
the procedure succeeds, because we can unambiguously 
assign the input to one or the other set. The probability 
to get this outcome, if the input state is is 



(2.5) 



If the measurement projects |V'f')o!it onto Ima), the pro- 
cedure fails because it conditionally transforms all in- 
put system states into the output that cannot be distin- 
guished. The probability of this outcome, if the input 
state is jV'i), is 



(2.6) 



From the unitarity of the transformation in Eq. I|2.3() the 
relation 



Pi 



1 



(2.7) 



immediately follows, by taking the scalar product of the 
the two sides with their adjoints. 

The nature of the problem we are trying to solve im- 
poses a number of other constraints and requirements on 
the output vectors. Let us first consider the set of system 
states associated with a click in the jms) detector, {IV"")}, 
which we also call failure vectors. If they were linearly 
independent, we could apply a further state discrimina- 
tion procedure to them [g, contrary to our assumption 



that this direction is associated with an inconclusive out- 
come. Therefore, the optimal procedure should lead to 
failure vectors to which we cannot successfully apply a 
state discrimination procedure, implying that they are 
linearly dependent. In fact, more is true and it is easy to 
show that they must be coUinear by demonstrating that 
the contrary leads to contradiction. To this end, let us 
assume that the failure vectors are not collinear. Then at 
least one of the the failure vectors, will have a com- 
ponent in the direction that is perpendicular to jf/'i ) in 
Ti. We can set up a detector in the system Hilbert space 
projecting onto this direction and a click of the detector 
will tell us that our input state was not 1-01) but one of 
the other — 1 states. Thus, contrary to our assump- 
tion that the third dimension of the ancilla is associated 
with the inconclusive outcome, further discrimination is 
possible. Hence, the failure vectors must be collinear. 
Next, we take the scalar product between l-iAi") out and 



Using Eq. (|2.3|) and the fact that U is unitary 



lead to the conditions 



WW') = (V'llV,) (j>i) 



(2.8) 



Our objective is to find the optimal j?/;") and which 
satisfy Eqs. H2.5|) - (|2.8|) and maximize the success proba- 
bility P. We shall now explore the consequences of the 
conclusion that (« = 1,---,A^) are collinear, i. e. 

the failure space, a subspace of TL, is one dimensional. If 
l^/jo) is the basis vector spanning this Hilbert space then, 
taking Eq. 1)2.6(1 into account, we can write the failure 
vectors as 



Wl) 



(2.9) 



where Xi is the phase of Substituting this represen- 
tation of the failure vectors in Eq. 12.81) ') gives 

which determines the phases for j = 2, . . . , iV. 
Taking the magnitude of Eq. (|2.1U|) , yields 

= \Wi\^o)? (.?>1) • (2.11) 

These iV — 1 conditions are a consequence of unitarity and 
imply that only one of the N failure probabilities can be 
chosen independently. If we chose qi as the independent 
one we can express the others as qj — |(V'i|V'j)P/'J'i- Let 
Oij = (V'ilV'j) then the average failure probability, Q = 
"qiqi, can be written explicitly as 



mil 



From the condition for minimum, 

^ = 0, 

dqi 

we now find the optimal value of qi, as 



<7i 



(2.12) 



(2.13) 



(2.14) 
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Inserting this value into Eq. (|2.12() finally gives 



Q 



POVM 



N 



(2.15) 



This result represents the absolute optimum for the mea- 
surement problem at hand. In the following we will in- 
vestigate its range of validity and derive the complete 
solution that is valid for all values of the parameters. 



III. LIMITATIONS OF THE POVM AND THE 
COMPLETE SOLUTION 

The value given in Eq. (|2.15|) for the minimum prob- 
ability of failure cannot always be realized. For it to 
be true, there has to exist a unitary transformation that 
takes \'4ii)in to \ij^i)out in Eq. I|2.8|l . One of the conse- 
quences of unitarity is the conservation of norm which 
is expressed by Eqs. (|2.5I) " (|2.7() . Another consequence 
is the conservation of the scalar product which we only 
partially used in Eq. (|2.8|l . Taking the scalar product of 
\'4'i)out with \ipk)out from Eq. H2.3|) leads to the general- 
ization of Eq. 12.8|l . 



(3.1) 



Obviously, k = \ and I = j > 2 reproduces Eq. (|2.8|) as 
a special case since, according to Eq. (|2.4ll . (V'llV'j) = 
for = 2, . . . , N. These equations imply that 



JiXk-Xl) 



(3.2) 



This set of equations can only be true if the matrix M, 
where 



Mik = (ipiliJk) 



pi(Xk-Xl) 



(3.3) 



is positive semidefinite, as discussed in detail in Ref. 0. 
The matrix Mik = (■0;|'0fc) has the structure 



M = 





Af^ j ' 



(3.4) 



where M" = Mu = 1 ~ qi and all other elements in 
the first row and first column are zero because of the 
condition (E3Il, and M'' = Mjj, for j, j' ^2,...,N. 

Thus, one of the positivity conditions is gi < 1 from 
the positivity of M". If qi — 1, then the average failure 
probability, which we denote by Qa, becomes 



Qa 



N 

i=2 



(3.5) 



which follows from Eq. H2.12|l with qi — 1. Note that this 
is the same average probability that we would obtain if 
we projected the state of the system we were given onto 
l^i)- 



In order to evaluate the positivity condition for 
we first express the second term on the right-hand-side 
of Eq. linSl as 



91 



(3.6) 



where we multiplied Eq. (|2.1U|) with its conjugate for j'. 
This allows us to write 



91 



(3.7) 



At this point, it is convenient to introduce the following 
notation. We call {l^/'i)} the a set and > 1} the (3 

set. Define Ha to be the one dimensional space that is the 
span of and Hfi to be the span of = 2, . . . N}. 

In addition, let Pa be the projection onto Tia, and Pp 
be the projection onto 7i^. This gives us two different 
decompositions of the system Hilbert space, In = Pa + 
Pa = Pp + Ppi where bar stands for projection onto the 
orthogonal complement. Then we can write, for j, j' > 1, 



Mn' = mPfMo') 



(V',|P^|^i)(V'i|P/3|^,-') 



91 



91 



(3.8) 



where = Pi}\i)i) is the component of 1-01) in Tip. 
This leads to a further decomposition of the TLp subspace, 

Pp^ pI+ P^ where = l^f) (^f 1/(^1 1^') and 
is the projection onto the orthogonal subspace of TLp. 

Thus, is positive semidefinite if qi > {ipl When 

9i — the failure probability is 



Qt3 = m{4\4) 



(3.9) 



This is the same failure probability that is obtained by 
projecting each quantum system we are given onto P|. 
Combining the conditions for the positivity of M" and 
we find that the POVM solution is valid if 



(^^IV-^) <9i < 1- 



(3.10) 



In view of Eq. H2.11|l this condition ensures that all fail- 
ure probabilities will be bounded by similar inequalities. 



< 



9, < 1 , 



(3.11) 



where we introduced the notation \ip^) — Pa\ijjj) for the 
component of any state from the second set in 7Y„. 

The boundaries for the validity of the POVM solution, 
Eq. (|3.1U|) (or Eq. I|3.11|l '). can be expressed in terms 
of the independent parameters of the problem. The a 
priori probability that the input state is from the a set 
is rja = Viy and the a priori probability that it is from 
the (3 set is 77^ = 1 — ria{= 1 — 771). Next, we introduce 
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the renormalized a priori probabilities, rj'j = rjj/r]/}, for 
j > 1. In terms of these renormahzed quantities we can 
write qi for the optimal POVM, Eq. H2.14|l . as 



V m 

Substitution into (|3.10|l yields upper and lower bounds 
for the a priori probability of the state to be filtered, 



5 + 1 



< 771 < 



S 



(3.13) 



with 



S: 



N 



(3.14) 



Within these bounds the POVM solution is valid. 

Summarizing H2.15|l , (|3.5|l , and (|3.9|l and taking (|3.13|) 
into account, we can write the optimal solution as 



iPOVM 



Q 



opt 



if TT^ < m < 1 — ii — 

l+S - /I - s+|(^;|^ll)|2 



Q/5 



if r]i < rjii 

if 771 > Tju 



s 

l+S 



(3.15) 



representing our main result. In the intermediate range 
of r/i the optimal failure probability, QpovM, is achieved 
by a generalized measurement or POVM. Outside this 
region, the optimal failure probabilities, Qa and Qp, are 
realized by standard von Neumann measurements, corre- 
sponding to two different orthogonal decompositions of 
Ti.. For rji < rji, I-H = Pa + Pa- A click of the Pa detec- 
tor corresponds to failure because it can have its origin 
in either of the two subsets and a click in the orthogonal 
directions uniquely assigns the input state to the /3 set. 
For rji > i]u, Ih = Pp + P^ + Pp- A click of the P| detec- 
tor corresponds to failure because it can have its origin 
in either of the two subsets, a click of the P/3 detector 
uniquely assigns the input state to the a set and a click 
of the Pp- detector uniquely assigns the input state to 
the /3 set. At the boundaries of their respective regions 
of validity, the optimal measurements transform into one 
another continuously. In its range of validity the POVM 
performs better than either one of the two possible von 
Neumann measurements. 

Finally, we want to point to an interesting feature of 
the solution. The results hold true even when the first 
input state, \tpi), lies entirely in Tip. In this case the two 
von Neumann decompositions coincide and the range of 
validity of the POVM solution shrinks to zero. A click in 
the Pa detector corresponds to failure since it can origi- 
nate from either of the two subsets and a click in one of 
the detectors along the orthogonal directions unambigu- 
ously identifies an input from the (3 set. 



IV. SET DISCRIMINATION AS 
DISCRIMINATION OF MIXED STATES 

In this section we shall establish a connection between 
quantum state filtering and the discrimination of mixed 
states. In fact, we will show that filtering is equivalent 
to the problem of discrimination between a pure state 
(a rank 1 mixed state) and an arbitrary (rank N) mixed 
state. Thus filtering can be regarded as an instance of 
mixed state discrimination. 

It is possible to express a number of the quantities in 
the solution in a more compact way. Since we do not 
want to resolve the individual states in the two sets, the 
states in a set can be given an ensemble description. To 
make the connection between the set discrimination and 
the ensemble viewpoint, we define two density matrices 



Pa = \tpi){tpi\ 

N 
J=2 



(4.1) 



where the primed quantities have been introduced in con- 
nection with Eq. H3.12|l . The a priori probabilities of 
these states are given by rja — rji and rjp = 1 — 771, re- 
spectively. Since these density matrices completely char- 
acterize the sets all results should be expressible in terms 
of them. Indeed, we have immediately that 

5= (Vi|p/3|^i) =Tr(p<,p/3) . (4.2) 

We ultimately want to find a compact expression for 
the optimal failure probabilities. We can express Qa in 
terms of p^, and Pa as 

Qa = 110+ rjp^T^iPaPp) 
= 110+ VpS 

= TlaMPaPaPa)+V(3'^r{PaPf3Pa) ■ (4.3) 

The last expression, although superfluous, makes it ex- 
plicit that in this case the measurement is a von Neumann 
projection on the one-dimensional subspace T-La- 

Similarly, we can express Q/j in terms of the density 

matrices and P| as 

n / / III /lt\ , Tr(pQp;3) 

(V^il^i) 
s 



= VaiWim) , ,11, ,|L 

wim) 

= 77„Tr(pjp„pJ)-f ,y0Tr(pjp;5Pj) . (4.4) 

The last expression, although again superfluous, makes it 
explicit that in this case the measurement is a von Neu- 
mann projection on the one-dimensional subspace 7i^. 

Finally, Q pov m can be written in terms of the density 
matrices as 



QpovM = 2-\/ rjaripS 



VaVisTripaPfi) ■ (4.5) 
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Since all of the failure probabilities can be expressed 
in terms of invariant expressions of the density matri- 
ces only, we have just shown that filtering is equivalent 
to the optimal unambiguous discrimination between a 
rank 1 mixed state (a pure state) and an arbitrary mixed 
state, providing the simplest example for discrimination 
between mixed states. 

Before leaving the realm of mixed state discrimination 
we want to point to an interesting connection to earlier 
work. We notice that the fidelity F between a pure state 
IV') and a mixed state p is given by |^ 



The optimal POVM failure probability can be written as 
QpovM = 2yfr]^F{pa,pp) . (4.7) 

This coincides with the lower bound on the optimal fail- 
ure probability found by Rudolph, et al. jllj, construc- 
tively proving that, for this case, the lower bound can be 
saturated in the range of validity of the optimal POVM. 



V. GEOMETRICAL INTERPRETATION OF 
THE OPTIMAL MEASUREMENTS 

In this Section we show that for a complete description 
of the POVM one does not need to invoke Neumark's the- 
orem. In fact, a complete description is possible without 
ever leaving the Hilbert space of the system and enlarging 
it with the ancilla degrees of freedom |^. Of course, Neu- 
mark's theorem is still useful when it comes to a physical 
implementation of the POVM. 

The POVM will have three possible measurement re- 
sults, one that corresponds to \tpi), one that corresponds 
to the f3 set and one that corresponds to failure. In 
order to describe the measurement, we introduce the 
quantum detection operators Hi, 112 and IIq, also called 
POVM elelments, corresponding the three possible mea- 
surement results. We then have that (?/'i|IIi|V'i) = pi 
(= Pa) is the probability of successfully identifying {ipi), 
(^ilIIolV'i) = <Zi (= Qa) is the probability of failing to 
identify {ipj\Il2\'ipj) = Pj is the probability of suc- 

cessfully assigning Itpj) (for j = 2, . . . , TV) to the /? set, 
and {ipjlllQlipj) = Qj is the probability of failing to assign 
For later purpose, we also introduce Pj3 = rjjPj 
and g^g — 1 — Pp. For unambiguous filtering we then re- 
quire (^,|ni|V,) = (^ilHalVi) = (for j = 2,...,N). 
We want these possibilities to be exhaustive. 



Hi + Ha + Ho = / 



(5.1) 



where / is the identity in Ti.. The probabilities are always 
real and non-negative which implies that the quantum 
detection operators are non-negative. The conditions of 
positivity and unambiguous filtering require that 



naiv-i) 



for j = 2,...N. 

In order to find the form of the POVM elements ex- 
plicitly, it is useful to define the subspace Hi to be the 
linear span of the, in general non-orthogonal but linearly 

independent, vectors, \ipi) and Note that Hi C H^ 
where _L denotes the orthogonal complement in H. The 
two POVM elements. Hi and 112, will be related to two 
different orthogonal decompositions of Hi. Indeed, the 
first of the above requirements immediately gives us the 
form of Hi. We must have 



III = ci|ei)(ei| 



(5.3) 



where |ei) is the unit vector in Hi that is orthogonal to 

\ipi)- The constant < ci < 1 remains to be determined. 

The second requirement tells us that the support of 
112 is contained in H^, the subspace orthogonal to Ha. 
We can learn more about 112 by looking at the failure 
operator which, from Eq. (|5.1() . is given as 



(5.4) 



Ho = / - Hi - n2 



This operator must be positive, and we want the failure 
probabilities to be as small as possible. For a normalized 
vector \v) e H^, we have 



{v\Uo\v) 



1 



{V\I12\V) 



(5.5) 



This will achieve the minimum value consistent with the 
positivity of IIq, which is 0, if 112 jt;) = \v). This means 
that we can express 112 as 



Tl-i = P1II2P1 + Pi 



(5.6) 



where Pi is the projection onto Hi, and Pi = I— Pi. The 
appearance of the projector Pi in the POVM element is 
a consequence of what is called the reduction theorem in 
[T^. Define [62) to be the normalized vector in Hi that is 
orthogonal to The second requirement in Eq. H5.2|) 

implies that Pi 112 Pi — C2|e2)(e2|, where < C2 < 1. 
Combining our results for the different parts of 112, we 
have that 



TI2 = C2|e2)(e2| +Pi 



(5.7) 



It is also possible to express the, as yet undetermined, 
constants ci and C2 in terms of the success or failure 
probabilities for the sets. From the definition of these 
probabilities, given at the beginning of this Section, we 
find 



ci = 



C2 



1 - gg 



5 



10 



(5.8) 



(5.2) 



Our final task is to choose ci and C2 as large as possible 
(this will minimize the failure probabilities) consistent 
with the requirement that Hq be positive. Since IIq is a 
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simple 2 by 2 matrix in Tii , the corresponding eigenvalue 
problem can be solved analytically. Non-negativity of the 
eigenvalues leads, after some tedious but straightforward 
algebra, to the condition 

QaQp ^ S = Tr{paPp) ■ (5.9) 

Note that this condition is consistent with Eq. I|2.11|l . 
Multiplying Eq. H2.11(l with -q'^ and taking the sum over 
j leads to the above condition. The task then is to find 
the minimum of the average failure probability 

Q = Vaqa + VPQP , (5.10) 

under the constraint of Eq. H5.9(l . This, once again, gives 
the solution H3.15|l . found via the Neumark approach. In 
particular, we obtain the optimum values of the failure 
probabilities as 

Qa = \ —S , qp = J — S . (5.11) 

Inserting these values in Eq. (|5.8|l gives us the explicit 
expressions for the optimal POVM elements. More im- 
portantly, the positivity conditions of ci and C2 give us 
the range of existence of the POVM solution. Obviously, 
for ci > we have to require < 1 and for C2 > we 
have to require < S/Wip^W^ . Combining these with Eq. 
(|5.9|) we obtain that the POVM solution is valid in the 
interval 

||V^f|P<9a<l, S<qp<-^ (5.12) 

which is of course identical to our earlier findings. 

From these results, it is now very easy to see what hap- 
pens at the boundaries. When qa = 1 and qjs = S we 
have ci = and C2 = 1 and the POVM degenerates into 
projective von Neumann measurements corresponding to 
the second decomposition of Hi. P1II2P1 — |e2)(e2| will 
be part of 112 for successfully identifying an input from 
from the /3 set and IIq = becomes a projector 

for failure, so the input will be missed completely. 

Conversely, when = HV'ilP and qfj = 5'/||?/'ilP we have 
ci = 1 and C2 = and the POVM degenerates into pro- 
jective von Neumann measurements corresponding to the 
first decomposition of T-Ci- Now, we have Hi = |ei)(ei| 



for successfully identifying the input as being from the 

a set and Hq = IV'i) (V"! l/(V'i IV'i) becomes a projector 
for failure. In this later case both types of input can 
be identified. Finally, we note that from these considera- 
tions it is clear that for the implementation of Neumark's 
theorem only the subspace Hi has to be entangled with 
the ancilla, giving further directions for an experimental 
realization. 



VI. CONCLUSIONS 

The usual problem considered when trying to unam- 
biguously discriminate among quantum states is to cor- 
rectly identify which state a given system is in when one 
knows the set of possible states in which it can be pre- 
pared. Here we have considered a related problem that 
can lead to further generalizations and applications in 
quantum information and quantum computing. The set 
of N possible states is divided into two subsets, and we 
only want to know to which subset the quantum state 
of our given system belongs. We considered the simplest 
instance of this problem, the situation in which we are 
trying to discriminate between a set containing one quan- 
tum state and another containing the remaining A'^ — 1 
states. A method for finding the optimal strategy for dis- 
criminating between these two sets was presented, and 
explicit analytical solutions were given. For the special 
case of iV = 3, which we treated earlier, we proposed a 
quantum optical implementation of the optimal POVM 
strategy based on linear optical devices only Since 
our original proposal the experiment has been performed, 
and the results are in perfect agreement with our theo- 
retical predictions [l^ . 

One application of these results is the development of 
novel quantum algorithms 5] . A more detailed considera- 
tion of these and related problems is left for a subsequent 
publication p^ . 

Acknowledgments 

This research was partially supported by the National 
Science Foundation (Grant Number: PHY-0139692), by 
a grant from PSC-CUNY as well as by a CUNY collab- 
orative grant. 



[1] J. A. Bergou, U. Herzog, and M. Hillery, Lect. Notes 
Phys. 649, 417-465 (Springer, Berlin, 2004). 

[2] C. H. Bennett, Phys. Rev. Lett. 68, 3121 (1992). 

[3] U. Herzog and J. Bergou, Phys. Rev. A 65, 050305(R) 
(2002). 

[4] Y. Sun, J. A. Bergou, and M. Hillery, Phys. Rev. A 66, 
032315 (2002). 

[5] J. A Bergou, U. Herzog, and M. Hillery, Phys. Rev. Lett. 
90, 257901 (2003); quant-ph/0209007. 



[6] K. Kraus, States, Effects and Operations: Fundamental 
Notions of Quantum Theory (Springer, Berlin, 1983). 

[7] M. A. Neumark, Izv. Akad. Nauk. SSSR, Ser. Mat. 4, 
277 (1940). 

[8] A. Chefles, Phys. Lett. A239, 339 (1998). 
[9] Y. Sun, M. Hillery and J. A. Bergou, Phys. Rev. A 64, 
022311 (2001). 

[10] M. A. Nielsen and L L. Chuang, Quantum Computation 
and Information (Cambridge University Press, 2000). 



8 



[11] T. Rudolph, R. W. Spekkens, and P. S. Turner, Phys. [13] M. Mohseni, A. Steinberg, and J. A. Bergou, 
Rev. A 68, 010301(R) (2003) quant-ph/0401002 Phys. Rev. Lett. 93, 200403 (2004). 

[12] Ph. Raynal, N. Liitkenhaus, and S. van Enk, Phys. Rev. [14] J. A. Bergou and M. Hillery, in preparation. 
A 68, 022308 (2003) 



